Separation of speech from interfering sounds based on oscillatory correlation

نویسندگان

DeLiang Wang

Guy J. Brown

چکیده

A multistage neural model is proposed for an auditory scene analysis task--segregating speech from interfering sound sources. The core of the model is a two-layer oscillator network that performs stream segregation on the basis of oscillatory correlation. In the oscillatory correlation framework, a stream is represented by a population of synchronized relaxation oscillators, each of which corresponds to an auditory feature, and different streams are represented by desynchronized oscillator populations. Lateral connections between oscillators encode harmonicity, and proximity in frequency and time. Prior to the oscillator network are a model of the auditory periphery and a stage in which mid-level auditory representations are formed. The model has been systematically evaluated using a corpus of voiced speech mixed with interfering sounds, and produces improvements in terms of signal-to-noise ratio for every mixture. The performance of our model is compared with other studies on computational auditory scene analysis. A number of issues including biological plausibility and real-time implementation are also discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The separation of speech from interfering sounds: an oscillatory correlation approach

A neural model is described which uses oscillatory correlation to segregate speech from interfering sound sources. The core of the model is a two-layer neural oscillator network. The first layer of the network identifies connected regions of energy in the time-frequency plane (segments). In the second layer, segments that have a common fundamental frequency are grouped into streams. A stream is...

متن کامل

An Oscillatory Correlation Frame work for Computational Auditory Scene Analysis

A neural model is described which uses oscillatory correlation to segregate speech from interfering sound sources. The core of the model is a two-layer neural oscillator network. A sound stream is represented by a synchronized population of oscillators, and different streams are represented by desynchronized oscillator populations. The model has been evaluated using a corpus of speech mixed wit...

متن کامل

A comparison of auditory and blind separation techniques for speech segregation

A fundamental problem in auditory and speech processing is the segregation of speech from concurrent sounds. This problem has been a focus of study in computational auditory scene analysis (CASA), and it has also been recently investigated from the perspective of blind source separation. Using a standard corpus of voiced speech mixed with interfering sounds, we report a comparison between CASA ...

متن کامل

Sparse Separation of Under-Determined Speech Mixtures

We are all familiar with the shape of sound from our secondary school science classes; the typical oscillatory form of a string under tension that decays over time is widely know. At first sight, this representation of sound imparts to the observer nothing more than its duration and amplitude. So how does the brain separate different sounds given such a representation? Over millions of years th...

متن کامل

Speech Enhancement from Interfering Sounds Using Casa Techniques and Blind Source Separation

In this paper we propose novel biologically plausible model for segregation of one dominant speaker from the other concurrent speakers and environmental noise in real cocktailparty scenario. The developed method integrates two powerful techniques: computational scene analysis (CASA) and blind source separation (BSS) technique with bandpass preprocessing. Since each of these techniques applied a...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

IEEE transactions on neural networks

دوره 10 3 شماره

صفحات -

تاریخ انتشار 1999

Separation of speech from interfering sounds based on oscillatory correlation

نویسندگان

چکیده

منابع مشابه

The separation of speech from interfering sounds: an oscillatory correlation approach

An Oscillatory Correlation Frame work for Computational Auditory Scene Analysis

A comparison of auditory and blind separation techniques for speech segregation

Sparse Separation of Under-Determined Speech Mixtures

Speech Enhancement from Interfering Sounds Using Casa Techniques and Blind Source Separation

عنوان ژورنال:

اشتراک گذاری